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We are interested in analyzing the effect of bandwidth sharing 
^\ • for telecommunication networks. More precisely, we want to calculate 

which routes are bottlenecks by means of large deviations techniques. 
The method is illustrated in this paper on a star network, where the 
bandwidth is shared between customers according to the so-called 
f^ ' min policy. We prove a sample path large deviation principle for 

ri . a rescaled process n~^Qnt, where Qt represents the joint number 

of connections at time t. The main result is to compute the rate 
function explicitly. The major step consists in deriving large deviation 
bounds for an empirical generator constructed from the join number 
of customers and arrivals on each route. The rest of the analysis 
relies on a suitable change of measure together with a localization 
^ ' procedure. An example shows how this can be used practically. 



1. Introduction. 



00 

^. 

2 ' ^^6 model. Consider a star shaped network (see Figure 1) consisting 

f^ , of N links connected to the other A^ — 1 through a central hub: there are 

N{N — 1)/2 routes of length two. In the sequel the set of links (or channels) is 
denoted by 5 = {1, ... , N}, whereas the set of routes is simply the set of un- 
ordered 

two-uples ij, i,j G S: for the sake of simplicity, we do not distinguish between 
ij and ji (i.e., we consider nonoriented routes), but there is no additional dif- 
k> i ficulty to handle oriented routes. Denote by qij{t) [resp. qi{t)] the number of 

;h ' calls (or connections) on route ij (resp. the number of calls involving channel 

i) at time t. Each link has a capacity (or bandwidth) equal to Cj (expressed. 
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2 F. DELCOIGNE AND A. DE LA FORTELLE 

e.g., in bits per second in the context of communication networks). Note 
that qi{t) = J2j Qiji^)- Then Q(t,x) = {qij{t),i,j € S) represents the state of 
the network at time t when it starts initially from state x. For the sake of 
simplicity, we shall sometimes omit x or t when they do not play a role. 

Documents to be transferred arrive on route ij according to a Poisson 
process of rate Xij. We shall denote by TZ the set of active routes, that is, with 
Xij > 0. The size of a document (expressed in bits) on route ij is supposed to 
be exponentially distributed with parameter /Xjj. Each document on route 
ij is allocated a portion Vij{x)/xij of the bandwidth when the state of the 
network is x. Hence, a document on route ij is transferred at rate /iijfjj(x). 
There are several possibilities in order to allocate a fair proportion of the 
bandwidth to customers. A classical one is to choose the coefficients Vij{x) 
according to the max-min fairness allocation. 

The star network is proposed as a model for a router where the bandwidth 
is shared fairly between the different connections. However, the max-min 
fairness allocation is not explicit and hard to analyze at first. In order to get 
a more tractable model, we focus on the min policy. 



Vij{x) 









J 



0, otherwise. 

It has been shown in Fayolle, de La Fortelle, Lasgouttes, Massoulie and Roberts 
(2001) that the system under the max-min fairness allocation is stochasti- 
cally smaller than the one with the min policy and that the network is 
ergodic if, and only if, 

(1.1) E— <Ci ViG5. 

i ^^^ 
However, it appears very difficult to compute quantities of interest like the 
mean transfer time of a document, so we turn to asymptotic analysis, espe- 
cially large deviations. 



Bandwidth 



\ \ \Channel 
^ \Connection 

Fig. 1. The (asymmetric) star network. 
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Previous work. Lots of work has been devoted to the analysis of telecom- 
munication networks. The model (star network and min policy) is described 
within telecommunication context in Fayolle, de La Fortelle, Lasgouttes, Massoulie and Roberts 
(2001). In the present paper we aim at deriving a sample path LDP for the 
rescaled process 



Q2=[l^Qint,[nx]), t>0|. 



Our main concern is to identify explicitely the rate function. This is a pre- 
liminary step in order to obtain large deviation bounds in stationary regime. 
This issue is discussed in Section 2. 

The major difficulty comes from the fact that the coefficients of the gen- 
erator are not spatially continuous [the service rate i2ij{x)]. It seems that 
one of the first papers dealing with large deviations for processes with dis- 
continuous statistics is Dupuis, Ishii and Soner ((1990)), where the case of 
Jackson networks was investigated using partial differential equations tech- 
niques. In Dupuis and Ellis ((1995)) a sample path LDP is proved for a wide 
class of jump Markov processes with discontinuous statistics. However, the 
methodology of proof uses subadditivity arguments and the rate function is 
not identified; moreover, there is a uniform reachability condition that our 
model does not fit. The identification of the rate function in this general 
framework is still an open problem when the dimension of the network is 
arbitrary. General results were obtained in Dupuis and Ellis ((1992)) and 
Ignatyuk, Malyshev and Shcherbakov ((1994)), where the LDP has been 
established. Nevertheless, in such examples, there are at most two bound- 
aries with codimension one or two where discontinuity arises. Using special 
features of the models and the fact that fluid limits could be completely 
identified, this program was carried out, for example, in Atar and Dupuis 
((1999)), Ignatiouk-Robert ((2000)) and in Delcoigne and de La Fortelle 
((2002)). 

Structure of the paper. An example (Section 2) shows how the rate func- 
tion expression can be used to compute decay rate for tails of stationary 
distribution. In Section 3 we introduce the central notion of localized model 
and of empirical generator; the rate functions (local and global) are studied. 
In Section 4 the local LDP is proved by mean of a change of measure and the 
identification of the local rate function is worked out for ergodic networks. 
In Section 5 the sample path LDP is stated. In Section 6 we get rid of the 
ergodicity assumption: in our opinion this is the main contribution of the 
present paper since the methodology used allows a treatment of more com- 
plex and realistic protocols like max-min-fair. This issue will be investigated 
in a forthcoming paper. 
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9 saturated channel 

O ergodit; channel 

active routes: A,A2 

jammed routes: A] 



Fig. 2. Representation of a star-shaped network: lines symbolize routes using two chan- 
nels (circles at the ends of the lines). The routes are partitioned into saturated routes 
(A), jammed routes (A.\) — the service rate being null on these routes since all channels 
belonging to A are saturated — and ergodic routes (A2J. 



Notation. In our settings, {Q{t,XQ), t > 0} is a Markov process with 
generator R such that, for all bounded real function / on Z^, 



Rfix)= J2 Qi^,y)ifiy)-fix)) 



VxG 



y-R. 



where 



ix,y) 



dcf 



Aij, 
^lij{x) 

0, 



dcf 



fJ'ijXij A , 



a y — X = e 
if y — X = 
otherwise 



ij, 



using the convention that 0/0 = (i.e., when Xij = 0). 

• For any set A, A^ will denote its complementary and l{yi} its indicator 
function; 

• L'([0,r],M^) is the space of right continuous functions /:[0, T] ^ M J 
with left limits, endowed with the Skorokhod metric denoted by d^; 

• C([0,T],]RJ) is the space of continuous functions equipped with the metric 
of the uniform convergence denoted by dc- 



Definition 1.1 (Face). For x G MJ, the face k{x) is defined by 

A(x) =f{^jG7^:2;iJ >0}. 
By an abuse of notation, we will also call face A 
(1.2) {y G MJ : yi^ > 0, Vij G A, and y,j = 0, Vij G A^}. 

A partition of the routes (see Figure 2) is defined by A and 

Ai = {ij eA^:3keS, ikeAor jke A}, 



dcf 



Aa = {ij eA^-.ykeS, ik^A and jk ^ A}. 
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nx+nD X 



Fig. 3. Structure of the local linear bounds of Theorem 1.2. L{x,D) is the cost per unit 
time for the path Q{t, y) (starting near nx) to stay in the neighborhood of nx + Dt over 
the time t £ [0,nr]. 



The vector space relative to A is defined by 



Results. For ergodic networks, our main result is the local linear large 
deviation bounds of Theorem 1.2. The notation is explained by Figure 3. 



Theorem 1.2. Assume that Q is ergodic and let x G MJ and D G M'^(^). 
Then, writing limT-,5,;_^o for limT-_+ol™5-^oliiTie^O) 



(1.3) 



liin inf liin inf — log I 

■,S,e~>0\y~nx\<en ri^oo riT 



sup \Q{t,y) — nx — Dt\<5n 

te[0,nr] 



1 



lim sup lim sup — log I 

T,S,e^f)\y-nx\<en n-*oo UT 



sup \Q{t.,y) — nx — Dt\<5n 

te[0,nr] 



Moreover, if A and the drift D G M are fixed, the preceding limit in r is 
uniform w.r.t. to x in compact sets of A (see Definition l.lj. The common 
value of these limits is denoted by —L{x,D) and 



(1.4) 
where 



L{x,D)= ^ l{Dij\\\ij,^iij{x)), 

ij&K{x){jKi{x) 



(1.5) K^||A,,.)'^^^Dlogf ^ + ^g + ^^^ UA + ^-7^^7Iv>0 

stands for the cost that a suitably normalized M/M/1 queue with parameters 
A and n, starting far from the origin, follows the drift D [see, e.g., Shwartz 
and Weiss ((1995))]. 
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Let us explain briefly the meaning of the different terms appearing in 
L{x,D) [see (1-4)]. Owing to the fact that the service rate Hij{x) tends to 
when Xij becomes null, while Xi V Xj remains strictly positive, the arrivals 
must be cut on the routes ij G Ai(x) in order to keep these routes in a 
neighborhood of 0. The cost to do this is J2ij£Ai(x) -^«i! indeed, /(0||Ajj,0) = 
Xij. Since the arrivals are cut on the routes ij £ Ai(x), the routes ij G A2(2;) 
are isolated from the rest of the network (see Figure 2) and so by (1.1) this 
set of routes behaves as an ergodic star network [with TZ = A2(x)] since Q 
is ergodic by assumption. Hence, the cost for the components ij £ A2(x) to 
stay in a neighborhood of is null. Now locally, the routes ij € A{x) behave 
as a set of independent M/M/1 queues with arrival and service rates Xij 
and fj.ij{x). The corresponding terms in L{x,D) represent the cost that this 
set of queues follows the prescribed drift D. 

The proof is done introducing a functional so called empirical generator 
consisting of Qt and of the join number of arrivals on routes belonging 
to A(x) U Ai(a;). In Section 4 large deviation bounds are obtained for the 
localized empirical generator from which Theorem 1.2 is derived using an 
adaptation of the contraction principle. 

Theorem 1.2 has been stated for ergodic networks. However, large devi- 
ations bounds can be obtained for transient networks, at the cost of some 
more detailed analysis. This is an important feature since it is linked with 
the study of networks under max-min-fair allocation (or similar ones). The 
reason is that, for an ergodic network under max-min-fair allocation, when 
some routes ij G A are made saturated (i.e., for localized models), the rest 
of the routes (in our notation A'^) can behave as a transient network, still 
under max-min-fair allocation: the local rate function must include the cost 
for this transient network to stay near 0. This is to the opposite of our 
framework, where only ergodic networks are considered, for which the cost 
to stay around is null. However, our methodology allows one to compute 
cost for a transient network under the min policy to stay around and as a 
corollary the rate function without ergodicity assumptions [see (6.6)]. The 
result is stated and discussed in Section 6. 

Moreover, the topology of the network can be extended, as well as the 
length of the routes, (but not arbitrarily) to include more realistic networks. 
However, the notation becomes very heavy and our aim is to present tools 
[extending those developed for polling networks in Delcoigne and de La Fortelle 
(2002)] in a fairly simple way for achieving the above program. 

Now, the rate function /t(0 for the sample path LDP is expressed as 

(-{ (\\ T ( "i — w / ^(v(^)5 V'(*)) '^^5 if (/9 is absolutely continuous, 
+00, otherwise. 
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Remark, /r(-) is defined by all the values L{x,D) with x G MJ and 
D G M^(a::) (i.e., the values treated by Theorem 1.2). Indeed, assume that 
for some t, ^ij{t) = and ^fijit) exists. Since ^ij{t) < ^ij{s) for all s, this im- 
plies ^ij{t) < 0. Then, necessarily, ipij{t) = 0. Moreover, if being absolutely 
continuous, ^ij{t) exists for almost all t. 

Define the level set 
(1.7) ^,{K)={ipeD{[0,T],R^):lT{ip)<K, ip{0) = x}. 

The final result is the following theorem. 

Theorem 1.3 (Sample path LDP). Assume Q is ergodic. The sequence 
{Q^, ?^ > 1} satisfies an LDP in L'([0,T],M^) with good rate function It{-)'- 
for every T>0, x G MJ, 

(i) forCcR'^ compact, D^^^^xiK) is compact in C{[0,T],R^); 
(ii) for each closed set F of D{[0,T],R^), 

limsup - logP[Q^ eF]<- inf{/T((/>), (/) e F, (/.(O) = x}; 

n— >oo n 

(iii) for each open set O of D{[0,T],M.'^), 

liminfilogP[Q" GO] >-inf{/r ((/>), 0gO,0(O) = x}. 

n— >oo fi ' 

Its proof is discussed in Section 5. 

2. Example. We would like to emphasize what kind of further results 
we aim at deducing from the sample path large deviation principle. First, 
it seems that the optimal paths of large deviation cannot, in general, be 
calculated, but some special solutions may be, leading to explicit expressions 
for the asymptotics of stationary distribution (which is not known) . This is 
a performance criteria of practical value: bounds for buffer size could be 
optimized, or simulation accuracy (through importance sampling using the 
change of measures associated to optimal paths) could be improved. 

Freidlin and Wentzell's works exposed in Preidlin and Wentzell ((1984)) 
suggest that the tail of the stationary distribution of the link i is related to 
It{') by the following formula: 

(2.1) lim ilogP[g,>n] = -mfinf{/T(v^):V^(0)=0,(^,(r) = l}. 

Although technical, it is reasonable to argue that the preceding equality 
can be checked in our case. However, this leads to an infinite-dimensional 
optimization problem. Nonetheless, by comparison with a processor sharing 
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channel I 



channel 2 



channe] 3 




Fig. 4. The network studied as example. The capacities of the respective channel are 
Ci = 3, C2 — 2 and C3 = 1. The arrivals and document size are A12 = A«i2 = 1, A23 = 1, 
/i23 = 2 and A13 =x, /iia = 1. 



model, it seems that we can have partial information. Indeed, if the optimal 
path leading to the saturation of a channel i is such that this channel is 
always the bottleneck (i.e., C-Jxi is minimal) then the service rate can be 
written, for each connection 

fJ'ijix) = fJ-ij(-^i • 

X{ 

This is exactly the service rate for a processor sharing model which is a well- 
known model [e.g., the stationary distribution is explicit; see Baskett, Chandy, Muntz and Palacios 
(1975)]. With some calculations we can find a necessary condition for a chan- 
nel i to behave like a processor sharing (having thus the same stationary 
distribution decay rate). Otherwise there are more complex interactions be- 
tween channels. 

To illustrate the changes in the channels behavior with the load, we esti- 
mated the queues decay rates for different arrivals rates A13 = x. We sim- 
ulated the network described in Figure 4 and obtained statistics for the 
stationary queue length ¥[Qi = n] decay rate. These results are compared 
with the decay rates of processor sharing models with the same parameters 
as the channel in Figure 5. 

The necessary condition we told about states that queue 1 can never 
behave like a processor sharing model, queue 2 can only if x < 0.292893, 
and queue 3 always can. This is, indeed, what we see on Figure 5. All plain 
lines are lower bounds and sometimes fit well simulation results. Queue 1 
is "pushed" by 2 and then by 3; queue 2 is on its own (i.e., behaves like a 
processor sharing) until approximately x = 0.2, then is pushed by 3; queue 
3 is always on its own. We see the necessary condition x < 0.292893 holds, 
but is not very tight. We hope this kind of study can furnish more detailed 
results and holds for other policies. 



STAR NETWORKS LDP: THE MIN POLICY 
3. Localized empirical generator, entropy and the rate function. 



3.1. Localized empirical generator. Take x G MJ^ and D G M^^^^ We are 
interested in computing large deviations bounds of the form (1.3) (i.e., lin- 
ear bounds as presented in Figure 3). In order to prove Theorem 1.2 we 
introduce a functional which allows one to measure how the different arrival 
rates should be modified in order that the rescaled process Q" follows a pre- 
scribed drift D. Moreover, the explanation exposed just after the statement 
of Theorem 1.2 suggests that the transition rates of routes indexed by A2(x) 
should not be modified and so it is useless to measure the arrivals on routes 
belonging to A2(a:;). Let us introduce the localized empirical generator at 
point X, as well as suitable state spaces associated to this process: 



Definition 3.1 (Localized empirical generators). Let A be a face and 
denote: 

• Aij{t), the number of arrivals on route ij till t; 

• the restriction A^{t) =\Aij{t), ij G Au Ai); 

• Gf = {jA (t), T ), the localized empirical generator on the face A. 



queue length decay rate for a 3 branches star network 




Fig. 5. Comparison between decay rates obtained by simulation to processor sharing mod- 
els with the equivalent parameters. Simulations were stopped at time T = 10 . 
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The set T of localized empirical generators is the set of elements [A ,D) 
with D G M^ satisfying: 

(i) fflij >0 VijGAuAi, 
^ ' (ii) aij-Dij>0\fijeAuAi. 

The space F is equipped with the distance d defined by 

d{G, G') "^ Y. I«^i - 41 + E \^^j - A', I VG, G' G r^. 

The inequalities (i) and (ii) in (3.1) refer, respectively, to the mean number 
of arrivals aij and to the mean number of deconnections per unit time, 
Qij — Dij being positive. 

Since it is difficult to analyze at first the behavior of Q{t) as in (1.3), we 
shall first establish large deviation bounds for the event 

(3.2) 4"i (x,G)1^'|g^(^)gB(G,5), sup \Q{t,y)-nx-Dt\<6n], 

where B{G,5) is the ball of center G and radius 6 [within the metric space 
(r (^',d)]. As it will emerge, strong constraints must be imposed on G in 

order that the event E^g (x,G) occurs at a large deviation scale. More 
precisely, the arrivals must be cut on routes belonging to Ai(x). 

Lemma 3.2. TakexeM.'^ and G= (^,L>) G r^(^), such that D e R^'-^'h 
If there exist m and p such that 



Xm = and Xp > 0, and Upm > 0, 



.W 



then E}j.J{x,G) almost never occurs at a large deviation scale, that is, 
(3.3) lim limsup — sup logP[£:|''j (x,G)] = -oo. 

T,<5,e-»0 n^oo riT \y-nx\<tn 

Proof. The proof relies on a change of measure, as in Section 4.1. In 
fact, on E^I{x, G) the service rate on route pm tends to when the different 

limits are taken. Since on E^l{x, G), the arrival process is not cut on route 
pm, the cost to keep the component pm of the rescaled process near is 
infinite. Details are similar to the proof of the upper bound (see Section 4.2) 
and are omitted. D 

Lemma 3.2 states that in order to prove large deviation bounds for the 
localized empirical generator, it will be sufficient to deal with the following 
subspace of T^^^^ . 
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Definition 3.3. Q denotes the set of localized empirical generators 
{A^,D) such that: 

(i) D G R^, 

(3.4) (ii) aij = OyijeAi, 

(iii) ttij > and aij — Dij > Vij G A. 

In this setting Q will represent the closure of Q^. 

Owing to Lemma 3.2, it is sufficient to deal with empirical generators 
satisfying (ii). In order to prove the large deviation local bounds, it will 
be sufficient to deal with empirical generators such that arrival and service 
rates are not cut, for ij G A(x), hence, condition (iii). A simple continuity 
argument will allow to extend the bounds obtained for G G Q^'^^' to G G 



3.2. Correspondence between localized empirical generators and star net- 
works. Let G = {A, D) £ Q be a localized empirical generator. It is as- 
sociated with a unique localized star network {Xij,ij.ij{y),y G M+) by the 
following relations: 

(i) Ajj = aij Vij G AU Ai, 

(ii) Xij = Xij y ij £ A2, 
(3.5) 

(iii) flij {y) = X^j - Aj V ij G A U Ai , V y G Mj, 



(iv) flij{y) = fiij{y) Vij G A2, Vy G Mj. 

Let us describe the behavior of this network when it starts from x [with 
A = A(x)]. In this case the routes belonging to A2 behave as a star network 
of the type presently studied and the parameters of the routes belonging to 
this set are left unchanged. Moreover, they are independent from the rest 
of the network. Indeed, if ij G A2, then Xik = for all k such that ik ^ A2 
(actually ik £ Ai; see Figure 2), hence, the constraints imposed on G insures 
that Xik = 0. Hence, 

l^ijiQis)) = Q^,is)^ ^7^-y^ A ^^-— - yij G Ai(x), 

proving the asserted independence. Moreover, the network consisting of the 
routes belonging to A2 is ergodic when the initial network is. Indeed, for all 
ergodic channel i (see Figure 2), 

j-.ij&Aii^'J j-.ij&Ail^'i j&S 1-^^ 
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Besides, routes belonging to A behave like independent M/M /I queues up 
to the initial conditions, whereas the routes indexed by Ai remain null. Now, 
the parameters have been chosen so that: 

Lemma 3.4. Assume that Q is ergodic. Letx^M^, G = {A,D) £g^^''\ 

a localized empirical generator, and denote ¥ the law of its associated star 
network. Then, for all t, 

lim inf liminf iP[M")(x,G)n{^j,(nT)=0 ViiGAi(x)}] = l. 

S,€->0\y-nx\<en n^oo '« 

Proof. The proof is omitted: it is a classical fluid limit. D 

3.3. Entropy. 

Definition 3.5. Let x G M^, R{x) = {Xij,fiij{x)) denotes the generator 
of the star network at x, G = {A, D) G Q'^\^) be a localized generator and 
{Xij,flij{y),y G MJ) its representation as a star network. The relative entropy 
of G with respect to R{x) is 

H{G\\R{x))= Y^ -fp(Aij||Aij) + /p(/zij||^y(x)), 

ijeA{x)uAi{x) 

where /p(z/||A) is the relative entropy of Poisson processes of intensities ly 
and A defined by 

(3.6) /p(z.||A)1^'z.log^-i. + A, 

with the convention § = and OlogO = 0. 

The entropy has an easy interpretation in terms of information theory: it 
can be defined as the mean information gain. H{-\\R) is decomposed as the 
sum of the information gain for the arrivals Ip{Xij\\Xij), the information gain 
for the service time Ip{jlij\\fiij{x)). 

Lemma 3.6. For fixed x, H[-\\R{x)) is continuous on Q 
Proof. It is an easy consequence of the expression (3.6). D 

3.4. The local rate function L{x,D). 

Definition 3.7. The local rate function L{x,D) is defined by 

(3.7) L{x,D)= inf H{G\\R{x)) VDgM^^^), 

where fA(x) ■ G^^''^ ^ ^^'^''^ is the projection fA{x)iG) = D. 
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It appears that L(x, D) is the cost for a set of MjMjl independent queues 
indexed by t^{x) U Ai (x) to follow the prescribed drift D when the queues 
are far from all boundaries. A simple computation yields 



i(0||A,M)=' 



i)iog p + ''°; + ''^'' )+A + M-/5"T1v>o 



for the cost that a M/M/1 queue with parameters A and /i follows the 
drift D [see, e.g., Shwartz and Weiss ((1995))]. Using this remark and the 
identity ^(0||A,0) = A, one can deduce the explicit representation (1.4) for 
L{x,D) [which is equal to (3.8) under the constraint Dij = ^ij{x) = for 
ij GAiix)]. 

In equations (1.4) and (3.7), L{x,D) is only defined for D G M'^^^). In 
order to study the properties of the rate function It{'), it is convenient to 
extend the definition of L(x, D) for all D, such that Dij > for all ij G A'^(x) 
by 

(3.8) L{x,D)''^ Y. mij\\Kj,^Xij{x)). 

ijeA{x)uAi{x) 

Proposition 3.8. The local rate function L{x,D) possesses the follow- 
ing properties. 

(i) It is positive, finite, strictly convex and continuous with respect to 
D, such that Dij > for all ij G A(x). It has compact level sets; 
(ii) there exists M G M such that 

L{x,D)>^\\D\\log\\D\\ VxGMJ, V||L»||>M; 

(iii) for a fixed D and a prescribed face A, L{x, D) is continuous for x £ A 
[see (1.2)]; 

(iv) L{x,D) is jointly lower semicontinuous w.r.t. x and D. 

Proof. Properties (i) and (ii) are obvious from (3.8). 

(iii) is clear from (3.8), noting that the functions fiij{x), ij G A, are con- 
tinuous for X belonging to the face A. Moreover, Ai(x) = Ai is constant for 
xG A. 

Let {x^^\D^^') tends to {x,D). First, it is clear that for n large enough, 
A(x) C A(x(")) and also A(x)uAi(x) C A(x(")) U Ai(x*^")). Hence, since I is 
positive, for sufficiently large n, 

(3.9) L(x("), !?("))> J2 ^(A^;^l|A.„/i.,(xW)). 

ij €A{x)uAi{x) 

Now, Xij > (since ij G TZ) so that /(-H Ajj, •) is continuous. Moreover, /ijj(x*-"'^) 
fJ.ij{x) y ij G A(x) U Ai(x). Therefore the right-hand side of (3.9) converges 
to L{x,D) and the lower semicontinuity (iv) is proved. D 
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3.5. The sample path rate function /t(-). 

Proposition 3.9. The rate function It {■) defined in (1.6) possesses the 
following properties: 

(i) Assume It{'^) ^ K for some K . Then, for all e > 0, there exists 5 > 
independent of ip, such that for any collection of nonoverlapping intervals 
[tj, ij+i] in [0, T] with Y,j tj+i - tj = ^, 

3 

(ii) /t(") is lower semicontinuous in (/^([O, T],]R^),(i(^); 

(iii) /or C C MJ compact, {j^^(j^x{K) is compact in C([0,r],Rj) [see 
(1.7) for the definition of the level set (^x{K)]; 

(iv) consider an absolutely continuous function ip with Iriv) < oo. Then, 
for all e > 0, there exists a piecewise linear function ip^ such that: 

(a) dc{(Pe,V') <£; 

(b) lT{fe)<lT{(p) + e. 

Proof. One proves (i) using Proposition 3.8(ii) in a way similar to 
Lemma 5.18 of Shwartz and Weiss (1995). 

In order to prove the lower semicontinuity of Iri'), (i) shows it is suf- 
ficient to consider sequences of absolutely continuous functions. Since on 
C([0,T],MJ), the metrics dc and dd are equivalent, one can use dc- Now, 
using Proposition 3.8(ii), the fact that L(x,D) is lower semicontinuous in 
{x,D) and convex with respect to D by Proposition 3.8, (ii) is proved by 
means of Theorem 3 of Section 9.1.4 in loffe and Tihomirov (1979). 

(iii) is a consequence of (i) and (ii) [see Proposition 5.46 of Shwartz and Weiss 
(1995)]. 

The proof of (iv) is a simple adaptation of Proposition 6.3(4) of Delcoigne and de La Fortelle 
(2002). D 

4. Large deviations bounds for the localized empirical generator. In this 
section, we aim at proving the following theorem: 

Theorem 4.1. Let x £ M+ and G = {A,D) gQ be a localized gener- 
ator. Then 

1 , ™.„(n) 



-H(G\\R{x))= lim inf liminf — logP[E^7„(2;,G)l 



lim sup limsup — \og¥[E^''ly{x,G)], 

T,<5,e— >0|j,_„3,|<^„ ■' "" 



n^oo UT 
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where E)^I{x,G) is the event defined in (^-2). Moreover, if a face A and a 
drift D G M are fixed, then the preceding limit in r is uniform with respect 
to X in compact sets of A (see Definition l.lj. 

4.1. An exponential change of measure. Fix an empirical generator G = 
{A,D) = {\ij,jlij{y),y G MJ) G Q and denote by the following: 

• Nt, the number of jumps of the process till t. 

• Q{k) = {Qij{k), i,j G S}, the embedded Markov chain at time A; G N. 
We shall distinguish between discrete and continuous time by using k for 
discrete and s or t for continuous time. 



Define the following: 

• The mapping h : Z J x Z^ i 



by 



h{x,y) 



dcf 



1 ^ij 



0, 



>y(x)' 



if y — X = Cij and Ajj > 0, 

\i y — x = —Cij and ft-ijix) > 0, 
otherwise. 



The compensator K : Z J h^ R by 



(4.1) 



^je7^ ijen 



The process 

m/= exp<j E KQ{k),Q{k + l))- f K{Q{s))dv 



'Nt-l 



k=0 



Note that the compensator is always bounded, so that Mt takes only finite 
values. Since K has been exactly defined so that 



K(x) = 4lE 
^ ^ dt 



expl E h{Q{k,x),Q{k + l,x)) 

[ k=0 



t=0 



it is easily checked that the derivative of E[A^i] at t = is null (note that 
the derivative is independent of A, so that it is dropped). Then using the 
Markov property, one can get that the derivative is null for all t > 0, so that 



nMt 



1. 
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Using again the Markov property, this proves that 

ElMtlJ's] = Ms for alH > s > 0, 

hence, {A^t, t > 0} is a martingale w.r.t. the natural filtration J-t. 
Then define a new probability measure by 

¥[B]'^=EmB}Mt] \fB£Tt. 

It is a matter of routine to show that under P, X is again a Markov process. 
In fact, under P, the system behaves like a star network, where the arrival 
and the service rates at node ij are respectively given by A^ and jlij{y) 
(whence the notation). 

Remark. The probability measure P is not necessarily absolutely con- 
tinuous with respect to P. This is the case, for instance, if for some ij G 7^, 
Xij = (whereas Xij > 0). 

4.2. Proof of the upper bound of Theorem 4.1. Since P is not necessarily 
absolutely continuous with respect to P, in order to prove the upper bound, 
we introduce a sequence of change of measure {p(^) , r/ > 0} such that 

X\f > and lim aJJ^ = Xij V ij G A(x) U Ai (x) , 

il\j > and lim fil^ = fiij{x) \fij G A(x). 

In this setting, {A4^ , t > 0} is the martingale defining P'''' with respect 
to P, and h^^'{x,y) and K™{x) are the functions used to defined M^ 
according to Section 4.1. Now, P^''^ and P are mutually absolutely continuous 
and 

(4.2) P[4}yx,G)] =i('')[l^^(„o^(^^^^^(M^^)-^ 

Let us majorize (A^m-)~^ on E^J{x,G) when \y — nx\ < en. First, re- 
calling 

Xij = Xij for ij G A2 

and 

fj-ij {y) = fJ-ij {y) for ij G Ai U A2 and y G M J 
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one has the following bounds: 
- Yl h(^\Q{k),Q{k + l)) 

fc=0 



rSv) 



IJ'^ 



<-nT ^ /iijlog 

\ijGA(x) SUp,g[o,„^]//ij(Q(s)) 



+ E 

ijeA(a;)UAi(x) 



A^?)' 



Ay log -^ 



V ijeA(a;) 



log 






infse[o,nr]Mii(Q('S)) 



(4.3) 

Moreover, the compensator K is bounded in (4.1) by 

K^'^\Q{s))ds 



+ E 

iiGA(x)uAi(x) 



log 






UJ 



(4.4)' 



<^^ E (AS;^-A.,) + nr Y. ( 4'^ - _inf ,/.,,(Q(.)) 



ijGA(a;)UAi{a;) 



ij&K{x) 



sG[0,nr] 



Besides, on E^}{x,G), we have for ij G A(x) 

0<fiij{x)= lim inf liminf inf /Uij(<5(s,y)) 

T,(5,e— >0 |j/— na;|<€n "^oo sg[0,nr] 

= lim sup limsup sup fiij{Q{s,y)). 

r,S,e~*0 \y_nx\<en n-*oo se[0,nT] 

Finally, majorizing 1,^(1) ^^ by 1, bounding Mnr using (4.3), (4.4) and 

(4.5) and taking into account the order in which the different limits are 
taken, the representation formula (4.2) yields 

lim limsup — sup logF[E^J{x , G)] 

T,5,e^0 n^oo TIT \y-nx\<en 



in) 



<- E A.,log^-Al;^ + A., 



ijeA(x)uAi(x) ' '-^ 



in) 



The proof of the upper bound is concluded letting r] tend to 0. 
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4.3. Proof of the lower bound of Theorem A.l . Take G G ^^^^ and denote 
the event (appearing in Lemma 3.4) 

4iyi^,G) 1^' Ei^^ix,G) n iMnr) = Vij G Ai(x)}. 

Although P is not absolutely continuous w.r.t. P, by definition of Q ^^' , 
Xij > and flij > Vij G A{x) so that P is absolutely continuous w.r.t. P on 

4iyi^^D) and 

r[Ei%{x,G)]>F[F^%{x,G)] 

> inf M-Hu^)nFSjx,G)]. 



jeF"- ' (x,D) 



By Lemma 3.4, ^[F^^yAx , G)] tends to 1. Therefore, reversing the inequali- 
ties obtained for the upper bound yields 

lim liminf— inf logFlE^i'') (x , G)] 

T,5,e^0 n-^oo riT \y-nx\<en ' '^ 

> - Y^ Xij log -^ - Xij + Xij 

ijeA{x)uAi(x) 'i 

ij&A{x) f^^i^^> 

This concludes the proof of the lower bound when G G Q'^^^' . 

Consider G gG ^ and define G^"^) by A^-^ = Xij + e and fL\j = fiij + e, 

for ij G A(a;); otherwise, the coefficients are the same. Then G'^' belongs to 
gH^) for e > 0, it converges to G and its entropy converges to H{G\\R{x)) 

by Lemma 3.6. Moreover, the drifts D = {Xij — flij) and D^^^ = {Xfj — {i(j ) 
are equal. 

For any eo > there exists ei > and 5i > such that, for all < e' < ei 
and < y < 5i, BiG^^\5') C B{G,6) and i7(G(^) ||i?(x)) < H{G\\R{x)) + eo. 
For the sake of simplicity, we shall denote G^"^' by G'. Since D' = D, we get 
the time uniform, inclusion 

eI';IJx,G')ce';!;1{x,G) Vt>0. 
It yields, using the decrease of E^g, (x,G') with 6', 
lim liminf — inf logP[£'l'\\ (x,G)l 

,^0 n-*oo nr \y-nx\<en ^ ^ ^''^'^^^ ^^ 



— inf logP[£i"j, , 

5',e^0 n^oo riT \y-nx\<en ' '^ 



> lim liminf — inf logP[£;H^, (j;,G')l Vt>0. 
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Using the lower bound for G' € Q^y^i and the uniformity over time of the 
previous bound, by letting r tend to we deduce that (depending on eq) 
there exists tq such that, for all < r < tq, 

limliminf — inf logP[£;h\,(2;,G)l > -H{G'\\R{x)) - eq. 

e^O n^oo riT \y-nx\<en ' '^ 

Now recall that the entropy is bounded (by continuity) when G' — > G so that 
there is no problem when 5 decreases, for all < r < tq, 

lim liminf — inf logFlEl"-) (x , G)] > -H(G\\R(x)) - 2eo. 

S,€^0 n->oo nr \y-nx\<en ' '^ 

Since this is true for any eq, we get the lower bound for G, 

lim liminf— inf logF\E''"'l(x,G)]> -H(G'\\R(x)). 

r,<5,e-»0 n-»oo riT \y-nx\<en ' '^ 

— A(x) 

Theorem 4.1 is proved for any G £Q 

The uniformity of the limit stated in Theorem 4.1 is easily checked. 
Nonetheless, this uniformity is clear as far as x evolves on compact sets 
of some face A. Indeed, if Xij goes to for some ij G A, then fJ-ij{x) possibly 
vanishes and difficulties can appear. 

Proof of Theorem 1.2. Now Theorem 4.1 imphes the large deviations 
local bounds of Theorem 1.2. Moreover, if a face A and a drift D G M are 
fixed, then the limits in (1.3) in r are uniform w.r.t. x in compact sets of A. 
The proof relies on a simple adaptation of the contraction principle, similarly 
to the proof of Theorem 7.2 of Delcoigne and de La Fortelle (2002). Details 
are omitted. D 

5. Sample path LDP. The proof of the sample path LDP is done in two 
steps which are briefly recalled. Using Markov property, Theorem 1.2 and the 
continuity of L{x, D) with respect to x G A(Z?) for fixed D, large deviations 
bounds are established for the probability that the process stays near some 
linear path. 



Proposition 5.1 (Linear bounds). Let x G M.'^ and D G M , satisfying 
x + DT G M^. Denote ip the function such that ip{t) = x + Dt for allt G [0, T]. 
Then 



-It(^) = limliminf— inf logF 

<5,£-»0 n^oo n \y-nx\<en 

= lim lim sup — sup log I 

<5,€^0 n-^oo n \y-.nx\<en 



sup \Q{t,y) — nip{t)\ < 6n 

te[o,T] 

sup \Q{t,y) — n{p{t)\ < 6n 

te[o,T] 
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Proof. Due to the fact that the intensity /iij(x) is not bounded away 
from 0, this proof is quite involved. This is where the technical uniform 
reachability condition of Dupuis and Ellis ((1995)) is used; but it does not 
hold in the present model, even if the final result is the same. It is discussed 
in some detail in Section 4 and Appendix B of Delcoigne and de La Fortelle 
(2001). D 

From linear paths to LDP. The sample path local bounds of Theorem 1.3 
are now proved for linear paths (Proposition 5.1). There are some steps to 
reach the LDP, which we outline here. 

First, the local bounds are extended to piecewise linear paths. Using the 
Markov property, the proof looks very much like that of Proposition 5.1. 

Second, the local bounds are extended to absolutely continuous paths with 
finite entropy, using the properties of It{')- Notably points (ii) and (iv) of 
Proposition 3.9 imply that for an absolutely continuous ip with /r(v) < oo, 
there exists a sequence {ifn, n > 1} of piecewise linear paths satisfying 

lim dciiPn,V^) = and lim It (fn) = It if)- 

n — ^oo n — >oo 

The next step is to prove the exponential tightness of the sequence {n~^Q{nt, 
[nx]), n> 1} over finite interval of time (uniformly for x belonging to a 
compact set). This is done, for instance, in Dupuis, Ellis and Weiss (1991). 
Finally, Theorem 1.3 is proved. These last two steps use various properties 
of the rate function It{-) and Proposition 3.9. The reader is referred to 
Section 5 of Dupuis and Ellis (1995) for details. 

6. LDP without ergodicity assumption. Theorem 1.2 states large devi- 
ation bounds for ergodic networks. However, at the expense of cumbersome 
notation, it is possible to compute these bounds directly without ergodicity 
assumption introducing a more detailed empirical generator. For the ease of 
the exposition, the study was first performed for ergodic systems. We show 
now how one can compute, in general, L(x,D). The discussion after Theo- 
rem 1.2 explains why the main difficulty to overcome is to compute the cost 
for an arbitrary star network under the min policy to stay in a neighborhood 
of 0. 

Proposition 6.1. Let Q be not necessarily ergodic. For all r > 0, 



lim inf lim inf — log I 

5,e^0\y\<en n^oo jiT 



sup \Q{t,y)\ < 5n 

tG[0,nr] 



: lim sup lim sup — log I 

<5,e^0 |y|^g„ n— >oo nT 



sup \Q{t,y)\ < 6n 

te[0,nr] 
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The common value of these lim,its is denoted by — L(0, 0) and 



(6.1) L(0,0) = inf Y^ (\/pA~-^//I~P~) = inf ^ l{0\\Xij,fj,ijUij), 

where l{-\\-,-) is defined in (1.5) and the set V by 

(6.2) V=Uu£R'^:Y,i'i3<Ci\/i£s\. 

I J65 J 

Note that Proposition 6.1 is a bit stronger than equality (1.3) of Theorem 1.2 
apphed to x = D = 0, since the time r is not necessarily short. Besides, the 
rate function L(0, 0) is not explicit, but is an algorithmically fairly simple 
problem since it is a convex program w.r.t. -i/i^. 

6.1. Proof of Proposition 6.1. As in the ergodic case, the proof relies on 
four steps: the introduction of a suitable empirical generator, the association 
of a star network to each empirical generator, the proof of large deviation 
bounds for empirical generator and finally, the proof of Proposition 6.1 using 
an adaptation of the contraction principle. 

6.1.1. Empirical generator. This process is a bit different than the one 
defined in the ergodic case (see Definition 3.1). It takes into account the sole 
case X = D = 0, but in the transient case. 

Definition 6.2. The empirical generator Gt is the functional defined 
by 

1 /■* 



Gt^^'i^^Ait),^!^ u{Q{s))ds 



del, 



where z^(x) = (z/jj(x), i,j G S). The set T of empirical generators is MJ x V; 
its elements will be denoted by G = (^4, z^). It is equipped with the distance 
d defined by 

d{G, G') 1^' Y. I"*J- - 41 + E \^ij - ''ij\ '^G,G'e r. 

Large deviation bounds are established for the event [similarly to (3.2)] 
(6.3) E^^-} {G)''^[Gnr^B{G,5), sup |Q(t,y)| < 5nj, 

where B{G,6) is the ball of center G and radius 6. Roughly speaking, when 
i^pm = the service rate are cut on route pm and so some constraints must 
be imposed on A. More precisely: 
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Lemma 6.3. Take G = {A, v) £T. If there exist m and p such that 



^pm = and ttpm > 0, 



n(n) 



then E^J{G) almost never occurs at a large deviation scale, that is, 
lim limsup — sup logP[£^^^ (G)] = — oo. 

r,5,e— ♦O n^oo UT |j;|<€n. ' 

Proof. The proof is similar to the proof of Lemma 3.2. □ 

By Lemma 6.3, it is enough to deal with the following subspace of F. 

Definition 6.4. Q denotes the set of empirical generator {A,v) such 
that: 

(i) aij = 0, when Uij = 0, 
(ii) Ej^ij <CiMi. 

Q stands for the closure of Q. 

6.1.2. Correspondance between empirical generators and star networks. 
Let G= (A,!^) £ Q. It is associated arrival and departure rates 

\ij = aij yij eJZ, 



where 



dof J —^, \/ij such that Vij > 0, 
0, otherwise. 



Then (Xijjjlijiy), y S Ml^) simply describes a star network under the min 
policy where the arrivals intensity and the duration of calls on route ij are 
respectively given by Xij and flij. 

Similarly to Lemma 3.4, we now prove the following lemma: 

Lemma 6.5. Let G = (Aju) G Q and ¥ the law of its associated star 
network. Then Q is ergodic under P. Besides, for all r, 

(6.4) lim inf liminf ]P[£;Hh = L 
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Proof. Since G £ G, the ergodicity condition (1.1) are easily checked 
for {Xij,fiij), so that Q is ergodic under P. Moreover, a straight appHcation 
of the ergodic theorem yields 

(6.5) 1^7/ T^ij{Q{s)) ds = -^ = Vij \/ij. 

t^oo t Jo flij 

Equation (6.4) is, thus, just a statement about fluid limits. D 

6.1.3. Entropy and local bounds. 

Definition 6.6 (Entropy). Let G = {A, D) eG he an empirical genera- 
tor and (XijjfLij) its representation as a star network. The relative entropy 
of G with respect to R, the generator of the initial star network is 

H{G\\R) = Y^ {Ip{Xij\\Xij) + Ip{Xij\\i^ijiiij)), 

where Ip is the entropy of Poisson processes defined in (3.6). 

Proposition 6.7. Let G= {A,u) ^Q be an empirical generator. Then 
-H(G\\R)= lim inf liminf — logP[^^"} (G)! 

= lim sup limsup — logP[£^^'V (G)], 

5,e-+0|y|<^„ n^oo nr ' '^ 

where E^}{G) is the event defined in (&.?>). 

Proof. The proof is similar to that of Theorem 4.1 and will not be re- 
peated. Note simply that the lower bound is first proved for G G ^ using, in 
particular. Lemma 6.5. It is then extended to all G G ^ using the continuity 
of the entropy H. 

D 

Proof of Proposition 6.1. Details are similar to the proof of Theo- 
rem 1.2 and, thus, omitted. Note that 

L(0,0)= -ini_H{G\\R). 
Geg 

Taking G = {A,^) G Q and minimizing w.r.t. A yields (6.1). D 

Taking into account Proposition 6.1, this leads to the following expression 
for L{x,D) for a network without ergodicity condition and for D G R^^^: 

(6.6) L{x,D)= V l{Dij\\Xij,^j,ij{x)) + hif V l{0\\Xij, Hiji^ij), 

— uSV — 

ij€A{x)uAi(x) ijeM(x) 

where V is defined in (6.2). 
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Remark. At the expense of heavier notation, this theorem could have 
been derived at once, as in Section 4 studying the following more detailed 
empirical generator 

Lt={-A{t),j j i^A2{x)iQ{s))ds,-Qt 

where z^AjW = {'^ij{y),ij e^2(.x),ye M^)- 
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